AITopics | model deployment

Collaborating Authors

model deployment

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A Conceptual Model for AI Adoption in Financial Decision-Making: Addressing the Unique Challenges of Small and Medium-Sized Enterprises

Vu, Manh Chien, Dinh, Thang Le, Vu, Manh Chien, Le, Tran Duc, Nguyen, Thi Lien Huong

arXiv.org Artificial IntelligenceDec-5-2025

The adoption of artificial intelligence (AI) offers transformative potential for small and medium-sized enterprises (SMEs), particularly in enhancing financial decision-making processes. However, SMEs often face significant barriers to implementing AI technologies, including limited resources, technical expertise, and data management capabilities. This paper presents a conceptual model for the adoption of AI in financial decision-making for SMEs. The proposed model addresses key challenges faced by SMEs, including limited resources, technical expertise, and data management capabilities. The model is structured into layers: data sources, data processing and integration, AI model deployment, decision support and automation, and validation and risk management. By implementing AI incrementally, SMEs can optimize financial forecasting, budgeting, investment strategies, and risk management. This paper highlights the importance of data quality and continuous model validation, providing a practical roadmap for SMEs to integrate AI into their financial operations. The study concludes with implications for SMEs adopting AI-driven financial processes and suggests areas for future research in AI applications for SME finance.

data mining, financial decision, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2512.04339

Country: North America > Canada > Quebec (0.14)

Genre: Research Report (0.50)

Industry:

Information Technology (1.00)
Banking & Finance > Financial Services (1.00)
Banking & Finance > Trading (0.94)

Technology:

Information Technology > Data Science > Data Quality (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

Add feedback

Privacy-Aware Joint DNN Model Deployment and Partition Optimization for Delay-Efficient Collaborative Edge Inference

Cheng, Zhipeng, Xia, Xiaoyu, Wang, Hong, Liwang, Minghui, Chen, Ning, Fan, Xuwei, Wang, Xianbin

arXiv.org Artificial IntelligenceFeb-28-2025

Edge inference (EI) is a key solution to address the growing challenges of delayed response times, limited scalability, and privacy concerns in cloud-based Deep Neural Network (DNN) inference. However, deploying DNN models on resource-constrained edge devices faces more severe challenges, such as model storage limitations, dynamic service requests, and privacy risks. This paper proposes a novel framework for privacy-aware joint DNN model deployment and partition optimization to minimize long-term average inference delay under resource and privacy constraints. Specifically, the problem is formulated as a complex optimization problem considering model deployment, user-server association, and model partition strategies. To handle the NP-hardness and future uncertainties, a Lyapunov-based approach is introduced to transform the long-term optimization into a single-time-slot problem, ensuring system performance. Additionally, a coalition formation game model is proposed for edge server association, and a greedy-based algorithm is developed for model deployment within each coalition to efficiently solve the problem. Extensive simulations show that the proposed algorithms effectively reduce inference delay while satisfying privacy constraints, outperforming baseline approaches in various scenarios.

algorithm, edge server, inference delay, (10 more...)

arXiv.org Artificial Intelligence

2502.16091

Country:

North America > United States (0.04)
Asia > China > Shanghai > Shanghai (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
(7 more...)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Game Theory (1.00)
Information Technology > Communications (1.00)
(3 more...)

Add feedback

In-situ Self-optimization of Quantum Dot Emission for Lasers by Machine-Learning Assisted Epitaxy

Shen, Chao, Zhan, Wenkang, Pan, Shujie, Hao, Hongyue, Zhuo, Ning, Xin, Kaiyao, Cong, Hui, Xu, Chi, Xu, Bo, Ng, Tien Khee, Chen, Siming, Xue, Chunlai, Liu, Fengqi, Wang, Zhanguo, Zhao, Chao

arXiv.org Artificial IntelligenceOct-31-2024

Traditional methods for optimizing light source emissions rely on a time-consuming trial-and-error approach. While in-situ optimization of light source gain media emission during growth is ideal, it has yet to be realized. In this work, we integrate in-situ reflection high-energy electron diffraction (RHEED) with machine learning (ML) to correlate the surface reconstruction with the photoluminescence (PL) of InAs/GaAs quantum dots (QDs), which serve as the active region of lasers. A lightweight ResNet-GLAM model is employed for the real-time processing of RHEED data as input, enabling effective identification of optical performance. This approach guides the dynamic optimization of growth parameters, allowing real-time feedback control to adjust the QDs emission for lasers. We successfully optimized InAs QDs on GaAs substrates, with a 3.2-fold increase in PL intensity and a reduction in full width at half maximum (FWHM) from 36.69 meV to 28.17 meV under initially suboptimal growth conditions. Our automated, in-situ self-optimized lasers with 5-layer InAs QDs achieved electrically pumped continuous-wave operation at 1240 nm with a low threshold current of 150 A/cm2 at room temperature, an excellent performance comparable to samples grown through traditional manual multi-parameter optimization methods. These results mark a significant step toward intelligent, low-cost, and reproductive light emitters production.

artificial intelligence, laser, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2411.00332

Country:

Asia > China > Beijing > Beijing (0.04)
Europe > Netherlands (0.04)
Asia > Middle East > Saudi Arabia (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology (1.00)
Semiconductors & Electronics (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Architecture (1.00)

Add feedback

VPI-Mlogs: A web-based machine learning solution for applications in petrophysics

Nguyen, Anh Tuan

arXiv.org Artificial IntelligenceOct-6-2024

Machine learning is an important part of the data science field. In petrophysics, machine learning algorithms and applications have been widely approached. In this context, Vietnam Petroleum Institute (VPI) has researched and deployed several effective prediction models, namely missing log prediction, fracture zone and fracture density forecast, etc. As one of our solutions, VPI-MLogs is a web-based deployment platform which integrates data preprocessing, exploratory data analysis, visualisation and model execution. Using the most popular data analysis programming language, Python, this approach gives users a powerful tool to deal with the petrophysical logs section. The solution helps to narrow the gap between common knowledge and petrophysics insights. This article will focus on the web-based application which integrates many solutions to grasp petrophysical data.

application, artificial intelligence, machine learning, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.47800/PVJ.2022.10-06

2410.05332

Country: Asia > Vietnam (0.37)

Genre:

Research Report (0.50)
Instructional Material > Online (0.40)

Industry: Energy > Oil & Gas > Upstream (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

StraightLine: An End-to-End Resource-Aware Scheduler for Machine Learning Application Requests

Ching, Cheng-Wei, Guan, Boyuan, Xu, Hailu, Hu, Liting

arXiv.org Artificial IntelligenceSep-5-2024

The life cycle of machine learning (ML) applications consists of two stages: model development and model deployment. However, traditional ML systems (e.g., training-specific or inference-specific systems) focus on one particular stage or phase of the life cycle of ML applications. These systems often aim at optimizing model training or accelerating model inference, and they frequently assume homogeneous infrastructure, which may not always reflect real-world scenarios that include cloud data centers, local servers, containers, and serverless platforms. The key innovation is an empirical dynamic placing algorithm that intelligently places requests based on their unique characteristics (e.g., request frequency, input data size, and data distribution). In contrast to existing ML systems, StraightLine offers end-to-end resource-aware placement, thereby it can significantly reduce response time and failure rate for model deployment when facing different computing resources in the hybrid infrastructure.

application, infrastructure, ml application, (14 more...)

arXiv.org Artificial Intelligence

2407.18148

Country:

North America > United States > California > Santa Cruz County > Santa Cruz (0.14)
North America > United States > Florida > Miami-Dade County > Miami (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Genre: Research Report (0.50)

Industry: Information Technology > Services (0.36)

Technology:

Information Technology > Cloud Computing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Naming the Pain in Machine Learning-Enabled Systems Engineering

Kalinowski, Marcos, Mendez, Daniel, Giray, Görkem, Alves, Antonio Pedro Santos, Azevedo, Kelly, Escovedo, Tatiana, Villamizar, Hugo, Lopes, Helio, Baldassarre, Teresa, Wagner, Stefan, Biffl, Stefan, Musil, Jürgen, Felderer, Michael, Lavesson, Niklas, Gorschek, Tony

arXiv.org Artificial IntelligenceMay-20-2024

Context: Machine learning (ML)-enabled systems are being increasingly adopted by companies aiming to enhance their products and operational processes. Objective: This paper aims to deliver a comprehensive overview of the current status quo of engineering ML-enabled systems and lay the foundation to steer practically relevant and problem-driven academic research. Method: We conducted an international survey to collect insights from practitioners on the current practices and problems in engineering ML-enabled systems. We received 188 complete responses from 25 countries. We conducted quantitative statistical analyses on contemporary practices using bootstrapping with confidence intervals and qualitative analyses on the reported problems using open and axial coding procedures. Results: Our survey results reinforce and extend existing empirical evidence on engineering ML-enabled systems, providing additional insights into typical ML-enabled systems project contexts, the perceived relevance and complexity of ML life cycle phases, and current practices related to problem understanding, model deployment, and model monitoring. Furthermore, the qualitative analysis provides a detailed map of the problems practitioners face within each ML life cycle phase and the problems causing overall project failure. Conclusions: The results contribute to a better understanding of the status quo and problems in practical environments. We advocate for the further adaptation and dissemination of software engineering practices to enhance the engineering of ML-enabled systems.

journal paper, ml life cycle phase, ml-enabled system, (12 more...)

arXiv.org Artificial Intelligence

2406.04359

Country:

Europe > Sweden (0.04)
Asia > Middle East > Republic of Türkiye (0.04)
Europe > Spain (0.04)
(12 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (1.00)
Overview (1.00)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.35)

Add feedback

The big tech firms want an AI monopoly – but the UK watchdog can bring them to heel John Naughton

The GuardianApr-20-2024, 15:00:31 GMT

"Monopoly," said Peter Thiel, Silicon Valley's answer to Darth Vader, "is the condition of every successful business." This aspiration is widely shared by Gamman, the new acronynm for the Valley's giants – Google, Apple, Microsoft, Meta, Amazon and Nvidia. And the arrival of AI has sharpened the appetite of each for attaining that blessed state before the others get there. One symptom of their anxiety is the way they have been throwing unconscionable amounts of money at the 70-odd generative AI startups that have mushroomed since it became clear that AI was going to be the new new thing. Microsoft reportedly put 13bn (about 10.4bn) into OpenAI, for example, but it was also the lead investor in a 1.3bn funding round for Inflection, Deepmind co-founder Mustafa Suleyman's startup.

competition, heel john naughton, startup, (13 more...)

The Guardian

Country:

Europe > United Kingdom (0.50)
North America > United States > California (0.26)

Industry:

Information Technology (1.00)
Government (0.73)
Law > Business Law > Antitrust Law (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.57)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.57)

Add feedback

Navigating Privacy and Copyright Challenges Across the Data Lifecycle of Generative AI

Zhang, Dawen, Xia, Boming, Liu, Yue, Xu, Xiwei, Hoang, Thong, Xing, Zhenchang, Staples, Mark, Lu, Qinghua, Zhu, Liming

arXiv.org Artificial IntelligenceJan-10-2024

The internet has enabled an unprecedented free flow and wide distribution of information on a global scale, which largely accelerated the democratization of information, fueling platforms like Wikipedia, YouTube, and StackOverflow. While this facilitated information democratization, it concurrently lowered barriers against unauthorized data use and piracy. The success of Deep Learning (DL) owes significantly to the availability of large-scale datasets available for training DL models [3], predominantly sourced from the internet [4].

arxiv preprint arxiv, data lifecycle, genai, (13 more...)

arXiv.org Artificial Intelligence

2311.18252

Country:

North America > United States (0.94)
Asia > Middle East > Qatar (0.04)

Genre: Research Report (0.50)

Industry:

Law > Intellectual Property & Technology Law (1.00)
Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.53)

Add feedback

Introduction to ML Deployment: Flask, Docker & Locust

#artificialintelligenceFeb-27-2023, 18:45:29 GMT

You've spent a lot of time on EDA, carefully crafted your features, tuned your model for days and finally have something that performs well on the test set. Now, my friend, we need to deploy the model. After all, any model that stays in the notebook has a value of zero, regardless of how good it is. It might feel overwhelming to learn this part of the data science workflow, especially if you don't have a lot of software engineering experience. Fear not, this post's main purpose is to get you started by introducing one of the most popular frameworks for deployment in Python -- Flask.

application, deployment, server, (14 more...)

#artificialintelligence

Country: North America > United States (0.15)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.47)

Add feedback

Enabling MLOPs in Three Simple Steps

#artificialintelligenceFeb-2-2023, 20:55:18 GMT

I recently engaged in a project involving the implementation of a multiclass classification prediction system utilising financial transactional data, comprising over 10 million records and over 70 classes. Through this project, I constructed a streamlined end-to-end machine learning operations (MLOPs) infrastructure that is well-suited for this specific use case, while maintaining cost efficiency. The term MLOPs has a broad range of concepts and definitions, as offered by various vendors or solutions. Some focus on aspects such as training traceability and experimental tracking, while others prioritise feature storage or model deployment. In my understanding, MLOPs is the entire end-to-end process, from data extraction to model deployment and monitoring.

artificial intelligence, jupyter notebook, machine learning, (11 more...)

#artificialintelligence

Technology:

Information Technology > Data Science (0.79)
Information Technology > Artificial Intelligence > Machine Learning (0.59)

Add feedback